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I thank Michael Larsen and Nathaniel Schenker 
for their thoughtful contributions, which usefully re- 
inforce and expand my arguments. 

Any differences between my perspective and that 
described by Schenker are minor. He defends a prag- 
matic approach to the Bayesian/frequentist divide, 
but in my applied work I confess I am pragmatic too. 
I provide confidence intervals and even P values to 
my biomedical collaborators, rather than posterior 
credibility intervals. I fear these would meet with 
head-scratching (not to mention article rejection) 
given current conventions of statistical reporting in 
medical journals. Like Schenker, I rely on similar- 
ities of frequentist and Bayesian interval estimates 
in many standard models. I am more focused on de- 
veloping good scientific models than on elicitation 
of prior distributions. 

Despite concessions to current-day realities, think- 
ing about and articulating the underlying principles 
that should guide our methodology is worthwhile. 
For example, in the survey sampling setting, I do 
not believe design-based inference is appropriate for 
some problems, and model-based inference is appro- 
priate for others. 

Larsen calls for more examples of how to achieve 
Bayesian calibration in practice. A very reasonable 
request, but complete textbooks are needed to do 
any kind of justice to that aim; my examples barely 
scratch the surface. Concerning his specific com- 
ments, I think Bayesian hierarchical models are huge- 
ly valuable; model checks seeking a good fit to the 
observed data are important, but unfortunately not 
sufficient to guarantee good predictions for missing 
data and target unknowns. A calibrated Bayes per- 
spective would I hope push the field toward more 
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research and training on how to develop good mod- 
els in practice. 

Schenker's theoretical and applied work on multi- 
ple imputation is influential, and the examples cited 
in his discussion add real substance to my mus- 
ings. Larsen's multiple imputation example allows 
me to illustrate Schenker's comment in his Section 
4.4 on the danger of omitting important variables 
in the imputation model. The relationship between 
parental longevity and offspring's diabetes was un- 
der study, and age of death was missing for parents 
currently alive. Larsen's imputation model condi- 
tioned on diabetes status of the offspring. This is 
important, since a multiple imputation model for 
this variable that failed to condition on offspring's 
diabetes status would lead to attenuation of the esti- 
mated relationship between these variables — indeed, 
it might well be worse than discarding the incom- 
plete cases, which would distort the distribution of 
parental age at death, but not necessarily the rela- 
tionship under study. 

Both discussants consider the Bayes/frequentist 
divide in the context of inference from survey sam- 
ples. As Schenker states, the debate is particularly 
lively in that area, given that the prevailing philoso- 
phy is to base inference on the randomization distri- 
bution that governs sample selection. My perspec- 
tive is described in Little (2004), but let me respond 
to some of Larsen's comments. The goal of design- 
based survey inference is "frequentist in nature," 
but for me (well-calibrated), Bayesian inference is 
just as useful and appropriate for inference about fi- 
nite population quantities as it is for model param- 
eters. The bald statement that "no model at all is 
involved in design-based inference" is oversimplified, 
since (as Larsen points out) design-based inference 
without any consideration of the implicit model un- 
derlying the choice of estimator leads to absurdities 
like Basu's (1971) famous elephant example. 

Terminology can be confusing, and Larsen's com- 
ment allows me to draw distinctions between my use 
of the term "calibration" and other uses. Deville and 
Sarndal (1992) discuss calibration of estimates to 
aggregate statistics. This form of data calibration is 
(in principle) automatically achieved by a Bayesian 
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model for prediction that incorporates this informa- 
tion, without the need for Deville and Sarndal's ad- 
hoc distance measure, though achieving it exactly 
may be challenging. Larsen also mentions "broader 
modeling options" in Sarndal, Swensson and Wret- 
man (1992). These model-assisted methods "cali- 
brate" model predictions with design-weighted ad- 
justments based on the model residuals. They lead 
to compromises between model predictions and di- 
rect design-based estimates, similar to the doubly 
robust competitors to the PPSP method in my Ex- 
ample 4. In my view, these approaches, which are 
ultimately design-based, are inferior to judicious ap- 
plication of the calibrated Bayes approach. For sim- 
ulation evidence in the context of my Example 4, 
see Kang and Schafer (2007) and Zhang and Little 
(2011). 

In my discussion of Hansen, Madow and Tepping 
(1983), I opined that the way to mitigate the effects 
of model misspecification is not to modify the esti- 
mator, as in these model assisted methods, but to 
modify the model. I have not changed my opinion. 
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